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2 LOW DENSITY PARITY CHECK CODES 1 

1 Introduction 

Low density parity check codes or LDPC codes were discovered by Gallager Q in 1962. 
Because of the lack of sufficient computing power at the time, these codes were largely ignored until 
recently. With the recent discovery of turbo codes and their iterative decoding techniques, there 
has been renewed interest in LDPC codes, which can also be decoded iteratively. It turns out that 
in many cases LDPC codes perform even better than turbo codes in achieving low bit error rates 
for a fixed signal-to-noise ratio. 

The problem with LDPC codes is that they are usually very difficult to encode. In 
Kou, Lin, and Fossorier address this problem by constructing LDPC codes based on finite 
geometries. These codes turn out to be cyclic codes, so they are very easy to encode. 

Since LDPC codes by definition have sparse parity check matrices, we consider the 
possibility that quantum CSS codes can be constructed from them. It is hoped that these sparse 
matrices will lead to more fault tolerant decoding techniques, since fewer computations will produce 
less error. 

In the next section we will define LDPC codes and present a simple decoding algorithm. 
Section 3 will define the finite geometries used in the codes of Kou, Lin, and Fossorier. Section 4 
will define these codes and some generalizations. The final section will give a CSS code construction 
based on a helpful suggestion from Shu Lin. 



2 Low Density Parity Check Codes 

We now define low density parity check (LDPC) codes. 

Definition 2.1. An LDPC code is a binary linear code whose parity check matrix H has the 
following properties: 

1. Each row consists of p ones. 

2. Each column consists of 7 ones. 

3. The number of ones in common between any 2 columns, denoted A, is no greater than 1. 

4. Both p and 7 are small compared to the length of the code. 
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As an example consider the code with parity check matrix 



/ HI 

000 

000 
000 
000 



H 



10000 
1111 
00000 
00000 
00000 



000000 
000000 
111100 
0000 11 
000000 



000000 \ 
000000 
000000 
110000 
001111 



100 
010 
00 1 
000 
000 



01000 
00 100 

000 10 

1 0000 
00001 



100010 
010000 
000001 
00 1000 
000 100 



000000 
001000 
000 100 
100010 
010001 



100 
010 
00 1 
000 

\ ooo 



00 100 

000 10 
00001 

1 0000 
01000 



000 100 
00 1000 
000010 
100001 
010000 



000 100 

10000 
000010 
001000 

1 00001 ) 



This example appears in |l|] . Note that the 2 bottom sections of H are column permutations of the 
top section. In this case, we have that the length n of the code is 20, p = 4, 7 = 3, and no two 
columns have more than 1 "one" in common. 

Gallager gives 2 iterative decoding algorithms in his paper. The first one is extremely 
simple and involves hard decision bit nipping. The second is a probabilistic soft decision algorithm. 
We will describe the first algorithm. The reader is referred to Q for details. 

The bit flipping algorithm is as follows: 

Step 1. Compute the parity-check equations. If they are all satisfied, stop. 

Step 2. Find the number of unsatisfied parity-check equations for each bit, denoted /, for i = 
0,1, ...,n- 1. 

Step 3. Identify the set S of bits for which /j is above some predetermined threshold. 
Step 4. Flip the bits in S. 

Step 5. Repeat steps 1 to 4 until all of the parity check equations are satisfied or a predetermined 
maximum number of iterations is reached. 



The problem with LDPC codes is that they are very hard to encode, especially when 
the length n is large. Kou, Lin, and Fossorier solve this problem by constructing an LDPC code 
which is actually cyclic. It is very easy and efficient to encode cyclic codes. 
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3 Finite Geometries 

We now discuss the finite geometries used in the construction of Kou, et. al. More 
information about finite geometries can be found in Q. 

Definition 3.1. A finite projective geometry consists of a finite set f2 of points p,q, ... and a col- 
lection of subsets L, M, ... of f2 called lines satisfying the following axioms. (If p G L, we say that 
p lies on L or L passes through p.) 

i. There is a unique line denoted (pq) passing through any 2 distinct points p and q. 

ii. Every line contains at least 3 points. 

iii. If distinct lines L and M have a common point p, and if q and r are points of L not equal to 
p, and s and t are points of M not equal to p, then the lines (qt) and (rs) also have a point 
in common. 

iv. For any point p there are at least two lines not containing p, and for any line L there are at 
least two points not on L. 

Definition 3.2. A subspace of the projective geometry f2 is a subset S of 0, such that if p, q are 
distinct points of S, then S contains the line (pq). A hyperplane is a maximal proper subspace of 

n. 

We will also use a second type of finite geometry in our constructions. 

Definition 3.3. A Euclidean geometry is obtained from a projective geometry by deleting the 
points of some fixed hyperplane. 

Definition 3.4. A set T of points in a projective or Euclidean geometry is called independent if 
for every x G T, x does not belong to the smallest subspace which contains T \ {x}. 

For example, any 3 points on a line are not independent. 

Definition 3.5. The dimension of a subspace S of a projective geometry is r — 1, where r is the 
size of the largest independent set of points of S. 

Our first example of a finite geometry will be the projective geometry PG(m, q). 

Definition 3.6. Let GF(q) be a finite field and m > 2. The points of 0, = PG(m, q) are the non- 
zero (m + l)-tuples (ao, ai, a m ) with o« G GF(q) such that (oo, ai, a m ) and (Aao, Aai, Aa m ) 
are considered to be the same point if A is a non-zero element of GF(q). (These are called homo- 
geneous coordinates for the points.) 
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£2 = PG(m, q) is a projective geometry of dimension m. There are q m+l — 1 nonzero 
(m+l)-tuples, and each point appears q-1 times, so has 



,171+1 



1 



q-1 

points. 

The line through the distinct points a = (ao, a±, ...,a m ) and b = (6q, &i, b m ) consists 
of the points 

(Aa + fxb , Aai + fibi, Xa m + iib m ) 

where A and fx are 2 elements of GF(q) which are not both 0. A line contains q + 1 points since 
there are q 2 — 1 choices for A and /x, and each point has q — 1 representations. 

A hyperplane or subspace of dimension m — 1 in PG(m, q) consists of those points 
a = (ao,a±, ...,a m ) which satisfy an equation 

Ao^o + Aiai + ... + A m a m = 

where the Aj are elements of GF(q) which are not all zero. Deleting the hyperplane 

{(Aooo, Aiai, A m a m )|Aoao = 0}, 

gives points which we can take to be of the form (1, a\, a m ). We call this set the Euclidean 
geometry EG(m,q). EG(m,q) has q m points which can be labeled as (ai, ...,a m ). 



4 Construction of LDPC Codes From Finite Geometries 

We now give the constructions of || for LDPC codes derived from the finite geometries 
of the previous section. Our first example is a class of LDPC codes based on EG(2, 2 s ). 

Let a be a primitive element of GF(2 2s ). Each nonzero element of GF(2 2s ) can be 
written in the form a 1 for some i. We can express a % as a 2-tuple Cj), where bi and q are in 
GF{2 S ) and a 1 = bi + Cj<x So GF{2 2s ) can be thought of as the 2-dimensional Euclidean geometry 
EG{2,2 S ). The point = (0,0) is called the origin of EG{2,2 S ). 

Let po,pi be two linearly independent points in EG(2,2 S ). Then the 2 s points of the 
form pq + fdpi with f3 E GF(2 S ) form a line passing through pq. 

I£p2 is linearly independent of both po and p\, then the lines {po + j3p\] and {po + PP2} 
intersect in the point po. Any two lines are identical or have no more than one point in common. 

Given a point po in EG(2, 2 s ), there are 



2 2s 



1 



2 s + 1 



lines intersecting at po, including the line fipo passing through the origin. There are 2 s — 1 lines 
parallel to any given line. (I.e. they have no points in common.) So EG(2,2 S ) has 2 S (2 2 + 1) 
distinct lines. 
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Given a line L and a primitive element a of GF(2 2s ), let 

VL = {vo,Vi, ...,V 2 2s_ 2 ) 

be a binary (2 2s — l)-tuple with Vi = 1 if a* is a point on L, and Vi = otherwise. is called the 
incidence vector of the line L. 

Now form the parity check matrix H for our LDPC code. H is a (2 2s — 1) x (2 2s — 1) 
matrix whose rows are the incidence vectors of the 

2 S (2 S + 1) - (2 s + 1) = 2 2s - 1 

lines in EG(2,2 S ) not passing through the origin. 

The parity check matrix H has the following properties: 

1. Each row has p = 2 s ones since there are 2 s points on any line. 

2. Each column corresponds to a non-origin point in EG(2,2 S ) and has 7 = 2 s lines passing 
through it. (Excluding the line through the origin.) 

3. Any 2 columns have one and only one "1" in common (i.e. A = 1) since given any 2 points, 
there is a unique line passing through these 2 points. 

Definition 4.1. The density of a matrix H is the ratio r of the total number of ones of H to the 
total number of entries in H. 

The parity check matrix for our code has a density 

2 s 

T = 

2 2 *-l 

The density is very small for large values of s. 

It turns out that we can construct H by writing down one row and circularly shifting 
to obtain all of the other rows. So we actually have a cyclic code, which has an easy encoding 
algorithm. 

We will now give an example. In order to understand it, we will need the following 
table of GF(16) generated using the primitive polynomial a 4 + a + 1 = 0. 
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as a 4-tuple 


as a polynomial 


as a power of a 


1000 


1 


1 


0100 


a 


a 


0010 


a 2 


a 2 


0001 


a 3 


a 3 


1100 


1 + a 


a 4 


0110 


a + a 2 


a 5 


0011 


a 2 + a 3 


a y 


1101 


1 + a + a 3 


a v 


1010 


1 + a 2 


a 8 


0101 


a + a 3 


a 9 


1110 


1 + a + a 2 


a lu 


0111 


a + a 2 + a 3 


a 11 


1111 


1 + a + a 2 + a 3 


a 12 


1011 


1 + a 2 + a 3 


a 13 


1001 


1 + Q 3 





Example 4.1. Let s = 2. Let GF{2 2 ^) = GF(W) be generated by the primitive polynomial 
X 4 + X + 1 = 0. This is the 2-dimensional Euclidean geometry EG(2, 2 2 ) over GF{2 2 ) 

Let a be a primitive element in GF(2 2 ^) and let f3 = a 5 . Then {0, l,/3,/3 2 } form the 
subfield GF(2 2 ). 

Every line in EG(2, 2 2 ) consists of 4 points. Letting po = a 14 gives the line {a 14 + rja} 
where r/ ranges over GF{2 2 ). Now the 4 values of rj are 

{0,l,/3,/3 2 } = {0,l,a 5 ,a 10 }. 

The 4 points on the line are now computed using the table. 



a 14 + l a = a i4 + a = 1001 0100 = 1101 = a' 

a 14 + a 5 a = a 14 + a 6 = 1001 © 0011 = 1010 = a 6 

a 14 + a 10 a = a 14 + a 11 = 1001 © 0111 = 1110 = a 
So our line is the set 

{a 7 , a 8 , a 10 , a 14 } 
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with incidence vector (000000011010001). The parity check matrix for our LDPC code is now 



H = 



( 000 
000 
000 
000 
000 
00 1 
1 1 
110 
101 
1 
100 
000 
001 
1 
\ 100 



00001 
000 1 1 
00110 
01101 
11010 
10100 
01000 
1000 1 
000 1 
00 100 
01000 
10000 
00000 
00000 
00000 



1 01 000 1 \ 

0100010 

1000 100 

0001000 

0010000 

0100000 

1000000 

0000000 

0000001 

00000 11 

0000 110 

0001101 

0011010 

110 100 

1 10 1 000 J 



The code based on EG(2, 2 s ) has the following parameters: 

• Length: n = 2 2s — 1. 

• Number of parity bits: n — k = 3 s — 1. 

• Dimension: k = 2 2s — 3 s . 

• Minimum Distance: d = 2 s + 1, so the code can correct 2 S_1 errors. 

Example 4.2. Let s = 7. We have an LDPC code based on EG(2,2 7 ). 

This is a (16383,14197) code with minimum distance 129, so it can correct 64 errors. 

The parity check matrix H has p = 7 = 128, A = 1, and density r=. 007813 

The rate R = k/n of this code is 14197/16383 w .867 If we increase s, the rate of the 

resulting code quickly approaches 1, so these codes are extremely efficient. 



We now describe a family of LDPC codes based on the projective geometries PG(2, 2 s ). 
Let a be a primitive element of GF(2 3s ). Let 

o3s _ 1 

n = -^-^ = 2 2s + 2 s + l 
If P = a n , then (5 has order 2 s — 1. The elements 0, 1,(3,(3 2 , ...,[3 2S ~ 2 form the subfield 

GF(2 S ). 
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Partition the elements of GF(2 3s ) into n disjoint subsets of the form: 
{a\/3a\/3V,...,/3 2S ~V} with < i < n. 

For each i represent this set as (a 1 ). For any a? € GF(2 3s ), if a? = [3 k a l with < i < n, 
then we represent a 3 by (a 1 ). The n elements of the form {a 1 ) are taken to be the points of PG(2, 2 s ). 

If (a 1 ) and (a 3 ) are 2 distinct points of PG{2, 2 s ), then the line L passing through them 
consists of points of the form {z\a % + Z2a 3 ), where z\ and Z2 are elements of GF(2 S ) at least one of 
which is nonzero. Since (z\a l + Z2O- 3 ) and {fi k z\a % + j3 k Z2a 3 ) are the same point, the line L consists 
of 

(2 s ) 2 - 1 

= 2 s + 1 points. 

2 s - 1 

Let (a m ) be a point which is not on the line (z±a l + Z2O1 3 ). Then the lines (zia 1 + Z2OL 3 ) 
and (zia m + z 2 a 3 ) intersect at the point {a 3 ). The number of lines in PG(2, 2 s ) intersecting at the 
point (a 3 ) is 

2 2s - 1 

= 2 s + 1 



There are 2 2s + 2 s + 1 distinct lines in PG(2, 2 s ). 

Given a line L, let = (vo, v±, u n _i) be a binary n-tuple with = 1 if (a*) is a 
point of L, and Vi = otherwise. Then is the incidence vector of the line L. 

We now can form an LDPC code whose parity check matrix H is a (2 2s + 2 s + 1) x 
(2 2s + 2 s + 1) matrix whose rows are the incidence vectors of the 2 2s + 2 s + 1 lines in PG(2, 2 s ). 

The parity check matrix H has the following properties: 

1. Each row has p = 2 s + 1 ones since there are 2 s + 1 points on any line. 

2. Each column corresponds to a point in PG(2, 2 s ), and 7 = 2 s + 1, since any point has 2 s + 1 
lines passing through it. 

3. Any 2 columns have one and only one "1" in common (i.e. A = 1) since given any 2 points, 
there is a unique line passing through these 2 points. 

The density of H is 

2 s + 1 



2 2s + 2 s + 1 ' 



a very small number for large values of s. 

As in the case of the code based on EG{2, 2 s ), we can construct H by writing down one 
row and circularly shifting to obtain all of the other rows. So in this case, we also have a cyclic 
code. 

The code based on PG(2, 2 s ) has the following parameters: 
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• Length: n = 2 2s + 2 s + 1. 

• Number of parity bits: n — k = 3 s + 1. 

• Dimension: k = 2 2s + 2 s — 3 s . 

• Minimum Distance: d = 2 s + 2, so the code can correct 2 S_1 errors. 

Example 4.3. Let s = 7. We have an LDPC code based on PG(2,2 7 ). This is a (16513,14325) 
code with minimum distance 130, so it can correct 64 errors. The parity check matrix H has 
p = 7 = 129, A = 1, and density r = .007812. The rate of this code is R = .867, comparable to the 
code based on EG(2,2 7 ). 

We will now briefly describe some generalizations given in M. They involve puncturing, 
extensions, or higher dimensional geometries. 

We first look at puncturing. Consider a LDPC code based on EG(2,2 S ). Choose a line 
L in EG(2, 2 s ) and remove the columns of H corresponding to the 2 s points in L. We now have a 
matrix with 2 2s — 2 s — 1 columns and an all zero row. Removing this row gives a (2 2s — 2) x (2 2s — 2 s — 1) 
parity check matrix. 

The new parity check matrix has 7 = 2 s ones in each column. The rows have 2 s or 
2 s — 1 ones depending on whether the lines intersect the original line L. We still have that any 2 
columns have exactly one "1" in common. 

The corresponding code is called an irregular LDPC code, since not every row has the 
same weight. 

We can also remove multiple parallel lines to get even shorter codes or puncture LDPC 
codes based on PG(2,2 S ). 

We can extend our LDPC codes by means of column splitting. Given an LDPC code 
of length n with parity check matrix H, we create a new code of length qn. Our new parity check 
matrix H ex t is formed by replacing each column of H by q columns with 2 < q < 2 s for a code 
based on EG{2, 2 s ) or 2 < q < 2 s + 1 for a code based on PG{2, 2 s ). 

As an example, for a code based on EG(2, 2 s ), write 2 s = 75 + 6 with < b < q. Each 
column is the same length as the original and b of them contain 7 + 1 ones, while q — b of them 
contain 7 ones. The ones are put into the columns in a rotating fashion. We illustrate this technique 
with the following example: 

Example 4.4. Let s = 2, so 2 s = 4, and let q = 3. We then have 4 = 37 + 6 so 7 = 1 and 6=1. 
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10 



So the column 



/ 1 \ 




1 
1 



1 




1 
1 




V 1 / 



becomes 



/ 100 \ 
000 
000 
010 
00 1 
000 
000 
100 
000 
000 
1 
00 1 
000 
000 

V loo/ 



Extension gives a (2 — 1) x q(2 — 1) parity check matrix H ex t with the following 



properties: 



1. Each row has weight 2 s . 

2. Each column has weight [—J ° r |_~J + 1- 

3. Any 2 columns have at most one "1" in common. 

We can construct extensions of codes based on PG(2, 2 s ) in a similar manner. Extending 
the codes increases the code rate and improves performance. Note that puncturing and extension 
can be used in combination. 

Finally, we mention that we can generalize all of the codes we have described for 
EG(2,2 S ) and PG(2,2 S ) to codes based on the higher dimensional geometries EG(m,2 s ) and 
PG(m,2 s ) for m > 2. See || for the details. 



5 Quantum LDPC Codes 

Quantum codes arise in a natural way from classical codes. We examined the possi- 
bility of forming a quantum LDPC code based on the finite geometry construction. The quantum 
construction we used was the well known Calderbank-Shor-Steane or CSS codes. These codes are 
described in many places in the quantum computing literature. We will use the definition found in 



Definition 5.1. Suppose C\ and C2 are [n, k\] and [n, fo] classical linear codes such that C2 C C\ 
and C\ and C% both correct t errors. We will define an [n, k\ — A^] quantum code CSS(C\,Ci) 
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capable of correcting errors on t qubits. We call this the CSS code of C\ over C 2 . The construction 
is as follows: If x is a codeword in C\ define the quantum state |x + C2) by 

\x + c 2 ) = —== J2 \ x + y)- 

V 1^2 1 y£C2 

Here \C 2 \ denotes the number of codewords in C 2 and x + y denotes bitwise addition modulo 2. 
The quantum code CSS(Ci,C 2 ) is defined to be the vector space spanned by the states \x + C 2 ) 
for all igCj. Note that \x + C 2 )=\z + C 2 ) if and only if x and z lie in the same coset of C 2 in C\. 
The number of cosets of C 2 in C\ is | Ci | / 1 C 2 1 , so CSS(Ci,C 2 ) is an [n,ki — k 2 ] quantum code. 

We can think of C\ as correcting the bit flip errors and as correcting the phase 
errors. (See for an explanation of why this works.) 

In order to construct a quantum version of a LDPC code, we needed to find families of 
these codes in which codes of the same length nest in a natural way, and whose duals are easy to 
describe. The finite geometry codes are cyclic, so they have easy to describe duals. The problem 
was in finding a way to nest these codes. 

In a private conversation, Professor Shu Lin, one of the coauthors of Q, suggested 
splitting rows of the parity check matrix in a manner similar to the column splitting extension 
technique we described in section 4. This leads to a code with a larger null space and hence a 
smaller code. The code produced is still a cyclic code, so it is still easy to find its dual. 

The technique is best illustrated with an example. 

Example 5.1. Consider the LDPC code given in Example 4.1. The code was a [15, 7] cyclic code. 
We gave the parity check matrix H in the example. Now split the rows of H using q = 2 to produce 
a 30 x 15 matrix H ex t. As an example, the first row 

( 000000011010001) 

of H becomes the first 2 rows 

/ 000000010010000\ 
V 000000001000001 J 

of H ex t. Using Mathematica, we row reduced the resulting matrix and determined that the new 
code has a check polynomial h{x) = x 3 + 1 and that the row space of H has dimension 12, so the 
new code has dimension 3. 

Now let C\ be the original code and C 2 be the code generated by row splitting. We 
know that C 2 C C\. So we have a CSS code where bit flips are corrected by C\ and phase shifts 
are corrected by C^- The CSS code is a [15, 7 — 3] = [15, 4] quantum code. 

Now C\ is an LDPC code. What can we say about C<^~? Since C 2 has a check polynomial 
h(x) = x 3 + 1, C2 has a generator polynomial g^{x) = x 3 (-^ + 1) = x 3 + 1. (See |J.) So the check 
polynomial for C^ is 

h ± ( x ) = ^1±1 = 1 + X 3 +X G + X 9 + X 12 
X 6 + 1 
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(The division is taken modulo 2.) We then have the parity check matrix 

/ 001001001001001\ 
H = 010010010010010 . 
\ 1 00 100 1 001 00 1 00/ 

This matrix has the following properties: 

1. Each row has p = 5 ones. 

2. Each column has 7 = 1 ones. 

3. Any 2 columns have one and only one "1" in common (i.e. A = 1). 

So C2 is actually an LDPC code. This means that bit flip errors and phase flip errors 
are both corrected by LDPC codes. 

The density of the parity check matrix for is 15/45 ~ .33, which is a little more 
than the density 4/15 « .27 for C\. 



6 Conclusions 

We have shown that we can use the finite geometry construction of LDPC codes to 
construct CSS codes. The CSS codes fix both bit flip and phase shift errors with LDPC codes. 
There is still a need to develop a general theory describing these codes. It is also hoped that 
the relatively simple decoding algorithm for LDPC codes will lead to more fault tolerant decoding 
algorithms for these CSS codes. Our initial example shows that this may be a promising research 
area. 
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